DDS: integrating data analytics transformations in task-based workflows

نویسندگان

چکیده

High-performance data analytics (HPDA) is a current trend in e-science research that aims to integrate traditional HPC with recent analytic frameworks. Most of the work done this field has focused on improving frameworks by implementing their engines top technologies such as Message Passing Interface. However, there lack integration from an application development perspective. workflows have own parallel programming models, while (DA) algorithms are mainly implemented using transformations and executed like Spark. Task-based models (TBPMs) very efficient approach for workflows. Data can also be decomposed set tasks task-based model. In paper, we present methodology develop HPDA applications TBPMs allow developers combine seamlessly. A prototype been PyCOMPSs model validate two aspects: seamlessly developed better performance than We compare our results different programs. Finally, conclude idea integrating DA into evaluation method against Spark.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recalibration of Analytics Workflows

As business decisions and strategies become more and more automated, real-time, and data-driven, enterprises need to create, manage and execute end-to-end analytics workflows that process increasing data volumes, from new heterogeneous data sources, on specialized processing engines. Workflows become more complex and time-consuming to design and execute, since they span a variety of systems and...

متن کامل

A Cloud Framework for Big Data Analytics Workflows on Azure

Since digital data repositories are more and more massive and distributed, we need smart data analysis techniques and scalable architectures to extract useful information from them in reduced time. Cloud computing infrastructures offer an effective support for addressing both the computational and data storage needs of big data mining applications. In fact, complex data mining tasks involve dat...

متن کامل

Reactive workflows for visual analytics

The increasing amounts of electronic data of all forms, produced by humans (e.g. Web pages, structured content such as Wikipedia or the blogosphere etc.) and/or automatic tools (loggers, sensors, Web services, scientific tools etc.) leads to a situation of unprecedented potential for extracting new knowledge, finding new correlations etc. Typically, such analysis is performed by using data visu...

متن کامل

Collaborative Data-centric Workflows: Towards Knowledge centric workflows and Integrating Uncertain Data

The acquisition of data, in particular for scientific data, is more and more organized in complex processes that are captured by workflows. These workflows are often driven by ontologies. For example the collaborative application Spipoll [3] proposes to collect information about pollination in France. The users take pictures of insects on flowers, download them on the application and then ident...

متن کامل

Data Analytics Applied to Chemical Transformations in Liquids

Elucidating the fundamental mechanisms of nanocrystal growth necessitates the utilization of high spatial resolution imaging techniques that are capable of directly imaging individual nucleation and growth events within a liquid phase. By combining time-resolved imaging datasets with quantitative image analysis algorithms, the factors controlling chemical transformations can be determined by an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Open research Europe

سال: 2023

ISSN: ['2732-5121']

DOI: https://doi.org/10.12688/openreseurope.14569.2